The Expected Profile of Digital Search Trees ∗ December 10 , 2009
نویسندگان
چکیده
A digital search tree (DST) is a fundamental data structure on words that finds myriad of applications from the popular Lempel-Ziv’78 data compression scheme to distributed hash tables. It is a digital tree in which strings (keys, words) are stored directly in (internal) nodes. The profile of a DST measures the number of nodes at the same distance from the root; it is a function of the number of stored strings and the distance from the root. Most parameters of DST (e.g., depth, height, fill-up) can be expressed in terms of the profile. We make here the first step towards deciphering an asymptotic behavior of the DST profile, a long standing open problem in the analysis of algorithms and combinatorics. Throughout we assume that strings stored in DST are generated by a memoryless source. We present a precise analysis of the average profile described by a sophisticated recurrence equation that we solve by analytic methods. This analysis is surprisingly demanding but once it is carried out it reveals unusually intriguing and interesting behavior. The average profile undergoes several phase transitions when moving from the root to the longest path: at first it resembles a full tree until it abruptly starts growing polynomially and oscillating in this range. These results are derived by methods of analytic combinatorics such as generating functions, Mellin transform, Poissonization and de-Poissonization, the saddle-point method, singularity analysis and uniform asymptotic analysis.
منابع مشابه
Probabilistic analysis of the asymmetric digital search trees
In this paper, by applying three functional operators the previous results on the (Poisson) variance of the external profile in digital search trees will be improved. We study the profile built over $n$ binary strings generated by a memoryless source with unequal probabilities of symbols and use a combinatorial approach for studying the Poissonized variance, since the probability distribution o...
متن کاملProfile and Height of Random Binary Search Trees
The purpose of this article is to survey recent results on distributional properties of random binary search trees. In particular we consider the profile and the height.
متن کاملOn the Average Profile of Symmetric Digital
Digital Search Trees (DST) are one of the most popular data structures storing keys, usually represented by strings. The profile of a digital search tree is a parameter that counts the number of nodes at the same distance from the root. It is a function of the number of nodes and the distance from the root. Several, if not all, tree parameters such as height, size, depth, shortest path, and fil...
متن کاملAsymptotic variance of random digital search trees
Asymptotics of the variances of many cost measures in random digital search trees are often notoriously messy and involved to obtain. A new approach is proposed to facilitate such an analysis for several shape parameters on random symmetric digital search trees. Our approach starts from a more careful normalization at the level of Poisson generating functions, which then provides an asymptotica...
متن کاملThe Variance of The Profile in Digital Search Trees
Author(s): R. Kazemi and M. Q. Vahidi-Asl Abstract: What today we call digital search tree (DST) is Coffman and Eve’s sequence tree introduced in 1970. A digital search tree is a binary tree whose ordering of nodes is based on the values of bits in the binary representation of a node’s key. In fact, a digital search tree is a digital tree in which strings (keys, words) are stored directly in in...
متن کامل